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SEPARATION WITH RESTRICTED FAMILIES OF SETS 


ZSOLT LANGI, MARTON NASZODI, JANOS PACK, GABOR TARDOS, AND GEZA TOTH 

Abstract. Given a finite n-element set X, a family of subsets C 2^ is said to separate 
X if any two elements of X are separated by at least one member of T. It is shown that if 
jj"! > 2"“^, then one can select [logn] + 1 members of X that separate X. If |J^| > q; 2” 
for some 0 < a < 1/2, then logn + 0(log ^ log log i) members of X are always sufficient to 
separate all pairs of elements of X that are separated by some member of T. This result is 
generalized to simultaneous separation in several sets. Analogous questions on separation 
by families of bounded Vapnik-Chervonenkis dimension and separation of point sets in 
by convex sets are also considered. 


1. Introduction 

For a set A, we say that a subset of X separates two elements if it contains one of them 
and does not contain the other. For a family X of subsets of A, we say that it separates a 
pair of elements of A if at least one member of X separates them. Furthermore, A separates 
X if every pair of distinct elements of A is separated by A. 

Suppose your computer is infected by a virus x E X, where A is the set of known computer 
viruses. You want to perform a number of tests to hnd out which virus it is. Each test detects 
a certain set of viruses, which can be associated with the test. Let A denote the family of 
subsets of A associated with the tests you can perform. These tests are sufficient to identify 
the virus if, and only if A separates A. The number of tests necessary is at least log |A|, 
where log stands for the base 2 logarithm. On the other hand, there is a family A C 2^ 
with |A| < [log|A|] that separates A. This is the starting point of a rich discipline called 
combinatorial search theory; see |AhlW87] . 

Any hxed pair of distinct elements in A is separated by subsets of A, thus a family 

A with |A| > separates A. Our Erst theorem states that in this case, even a small 

subfamily of A does the job. 

Theorem 1. Let X he a finite set, A C 2^^^ with |A| > Then X can be separated by 

a subfamily Q ^ X of cardinality at most flog |A|] + 1. 
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This statement is almost tight, but not completely. Indeed, for |X| = 5, Theorem [T] 
guarantees the existence of a 4-member separating family, but it is easy to verify that 3 sets 
suffice. In the following generalization we give the best possible bound. 

Theorem 2. Let Xi,he pairwise disjoint sets with |Xj| < n for i = 1,2,... ,k. Let 
X = lJi=i^T ^ 2^ satisfies \jF\ > then T has a subfamily of eardinality at 

most [logn] -|- 1 that separates X^ for every i {i = 1,2,..., k). 

The above bound is tight, that is, the same statement is false for every n>2, if we replace 
[logn] + 1 by [logn]. 

We call the density of X. If this slips below 1/2, we cannot guarantee the 

existence of a small subfamily separating X, as X itself does not necessarily separate X. 
But, as claimed by the next theorem, we can still find a small subfamily separating all pairs 
in X that X separates. We state this result for simultaneous separation. 

Theorem 3. Let Xi,... ,Xk be disjoint sets with \Xi\ < n for alii = 1,2,..., k. Let 
X = U^=i X 2^. Then X has a subfamily of size at most flog n~\-\-C log ^ log log ^ 

separating every pair in each Xi that is separated by X. Here a = is the density of 

X and C is a universal constant. 

Definition 1. Let X be a set of n elements. We call a pair (V, W) of disjoint subsets of X 
a constraint, 1/ U IT is the support and \V UW\ is the size of the constraint. A subset A 
of X satisfies the constraint (V, IT) if T C A and W O A = (j. A family X of subsets of X 
satisfies a constraint if the constraint is satisfied by some member of X. 

Note that the fact that two elements x,y ^ X are separated by a member of X means 
that at least one of the constraints ({a:}, {?/}) and ({?/}, {x}) is satisfied by X. 

Given a family X 2^ and a family of constraints satisfied by X, we are looking for a 
small subfamily of X that also satisfies the given constraints. The next theorem establishes 
what the density of X has to be (depending on the size of the constraints) for this to be 
possible. 

Theorem 4. For a positive integer m and 1 — < a < 1 there exists a constant 0 < 

c = c(m, a) with the following property. If X is a finite set, X is a family of subsets of X 
with density above a and C is a collection of N constraints, each of size m and satisfied by 
X, then there is a subfamily consisting of at most c\ogN sets from X that satisfies all the 
constraints in C. 

A similar statement is false for any m> 1 and a = 1 — . 

Up to this point, the ground set X and the family X were not assumed to possess any 
structure. The first such assumption we make is that of a bounded VC-dimension and prove 
a linear lower bound on the size of any separating subfamily in this case. Given a family 
X of sets, the Vapnik-Chervonenkis dimension (in short, VG-dimension) of X is the largest 
integer d for which there exists a d-element set A such that for every subset BOA, some 
member F of X has F n A = B. 

Let us fix d and assume a set of size n is separated by a family X of VG-dimension d. In 
this case |X| must be at least polynomial in n, namely \X\ > This follows from 
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the Shatter Function Lemma (or Sauer-Shelah Lemma, cf. Lemma 10.2.5 in |Mat02j ) and 
the fact that the dual of VC-dimension d family has VC-dimension below The size of 

a separator family of fixed VC-dimension d can, indeed, be a small polynomial of n, namely 
\J^\ < 2 ‘^nC (2 -1) can be obtained by considering the dual of the set system ( 2 / 1 )- 

We can show even stronger, linear lower bound on the size of a separating family if the 
base set to be separated can be arbitrarily chosen from an infinite universe with a bounded 
VC-dimension set system. 

Theorem 5. Let U be an infinite set and C 2^ a family of subsets of U of Vapnik- 
Chervonenkis dimension d. For every n > 0, there is a set X <zU with |X| = n such that 
any subfamily of iF which separates X, has at least members. 

Note that this theorem is almost tight. Let U be the set of positive integers and let X 
consist of the subsets of U containing 1 and separating at most d pairs {i,i + 1). The VC- 
dimension of is d and any n element subset of U can be separated by {n — l)/d + \ogd 
members of X. 

Now, we turn to separation problems where X and X have some geometric structure. 

Definition 2. A set of points in is said to be in general position if no d -|- 1 points lie on 
a (d — l)-dimensional affine subspace. 

A prototype of separation questions in the geometric setting was hrst studied by Gerbner 
and Toth |GT13j . They showed that for any set X of n points in general position in the 
plane, there is a family X of at most 20nloglogn/logn convex sets that separates X. On 
the other hand, there is a set X of n points in the plane in general position for which any 
separator of X, consisting of convex sets, has cardinality at least n/(21ogn -|- 2). 

It is natural to ask how these results about separating pairs of points could be extended 
to separation of pairs of fc-tuples of points. 

Definition 3. Let X be a set and k a positive integer with k <\X\ — 1. We say that a set 
F <Z X containment-separates a pair of fc-element subsets of X if F contains one of them 
and does not contain the other. A family X of subsets of X is a containment-separator of 
fc-subsets of X if for any two fc-subsets A and F of X there is at least one member of X that 
separates them. 


Our goal is to select a small subfamily of X that containment-separates the fc-subsets of X. 
As a generalization of the question discussed in |GT13] . we denote by c^{n) the minimum 
number c such that, for any set X of n points in general position in d-space, there is a 
containment-separator of fc-subsets of X which consists of c convex sets. This makes sense 
for fc < d -|- 1 and we have cf.{n) < (^) because the convex hulls of the fc-subsets of X 
containment-separate the fc-subsets. However, two /c-subsets with the same convex hull are 
not containment-separated by any convex set. Thus, c^(n) does not exist if d -|- 2 < A; < n. 

The result quoted above from |GT13] can be stated as 

n/(2 logn 2) < c^(n) < 20n log log n/ logn. 


Regarding higher dimensional point sets, it is observed in |GT13] that for any n and d. 
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Our goal is to find bounds on c^(n) for k >2. 
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It is a challenging problem to narrow the gap between the two bounds in (|1]) . Theorems [2] 
and [3] are shown in Section [2l Theorem H] in Section [3l Theorem |5] in Section |H Theorem [6] 
is proved in Sections [5l We briefly discuss intersection-separation, a relative of containment- 
separation, in Sectional 


2. Proofs of Theorems [2] and [3] 

Proof of Theorem [H We replaced our original proof of the hrst part of this theorem by a 
more elegant argument of Andras Meszaros [M14j . which was submitted as a solution to our 
problem at the Miklos Schweitzer competition in 2014. 

We regard P = 2^ as a vector held over F 2 with respect to the symmetric difference of 
sets, which we denote by As \Xi\ < n for all i we can hnd a family U of at most [logn] 
subsets of X (not necessarily in X) that separates each Xj. Let W denote the linear subspace 
of V spanned by U. The translates of W partition V, and thus, there is a translate W + c, 
more than half of whose members are in X. Pick a d G iW + c) H X and consider the set 
{x E W ■. X + d E X}. This set has cardinality larger than |lP|/2, so it spans IP, and 
thus, it contains a basis Z of W. Now, \Z\ = dim IP < \U\ < [logn]. We claim that the 
set S = {d} U {z + d ■. z E Z} separates each Xj. To see this, observe that for any pair of 
elements x,y E X, the set of those elements of V that do not separate x and y form a linear 
subspace of V, so if S did not separate x and y neither did any set generated by S. But this 
is not possible for x,y E Xi as S generates each element z of Z through z = d + {z + d) and 
thus S also generates the subspace IP including the sets in U, one of which separates x from 
y. This hnishes the proof of the hrst part of the theorem. 

Now we prove the second part. Let X be the union ofk = 2"'“^, pairwise disjoint n-element 
sets, let X consist of the elements of 2^ that intersect at least one class Xj in 0 or Xj. Then 
2^ \X has (2"' — 2 Y elements, and 

|2^ \X| _ / 1 Y"”' 

Hence, X contains more than half of the elements of X. On the other hand, assume that 
Q d X separates X, and let G E Q. Let Xj be a class in X which is intersected by G either 
in 0 or in Xj. Clearly, to separate Xj, we need at least [logn] more elements of X, which 
implies the assertion. □ 











SEPARATION WITH RESTRICTED FAMILIES OF SETS 


5 


Before presenting the details of the proof of Theorem [3] we introduce some notation and 
give a sketch of the proof. 

In the setup of both Theorems [2] and [3] we are given a base set X partitioned into the 
parts Xi of size at most n and we want to rehne this partition by selecting a small subset of 
a family C 2^. In the proof of the latter theorem we will select this separating subfamily 
in phases. After selecting a subfamily C we measure the progress by the maximal size 
m of a yet un-separated part of some Xj. Clearly, we need at least log(n/m) sets to partition 
a set of size n to sets of size m or smaller. Accordingly, we call the quantity |Xi| — log(n/m) 
the loss incurred in decreasing the part size from n to m. 

In this terminology we can phrase Theorem [2] as stating that if the density of X is above 
1/2, then we can decrease the part size all the way to 1 with a loss of less than 2. Similarly, 
Theorem [3] states that if X has density a and separates all Xj, then we can decrease the part 
size to 1 with a loss of 0(log(l/a) loglog(l/a)). Note that a loss of log(l/a) is unavoidable 
in certain cases, for example if we have a small set X C Xi and X consists of all sets disjoint 
from X, containing X, or having size 1. We are not sure if the loglog(l/a) factor is needed. 

We will prove Theorem [3] by constructing the separating family in phases. The hrst stage 
will reduce the part size to at most 1/a for a loss of less than 2. This stage is a generalization 
of Theorem [2] and proved very similarly. 

In the second phase we further reduce the part size to 0(log(l/a)) for a loss of 0(log(l/a)). 
We will select the separating sets in this phase one by one, but we remark that selecting 
them at once one can decrease the loss incurred in this phase to a constant for the small 
price of reducing the part size to 0(log^(l/a)) or even 0(log(l/a) loglog(l/a)) (instead of 
0(log(l/a)) as presented here). Unfortunately, the loss incurred in the third phase is much 
larger and dominates the losses in the other phases. 

In the third phase we make sure that all but (9(log(l/a)) elements of X form singleton 
parts, while in the hnal fourth phase we separate the remaining few non-singleton parts to 
“atomic” parts not even separated by X. 

Let A and X be hnite sets. We say that A cuts X well if |X|/4 < |X n A| < 3|X|/4. In 
the second phase the following trivial observation is going to be useful; 

Lemma 1. Let X be a hnite set of size m > 2. The density a of the subsets of X that do 
not cut X well satishes a < 1/2 and a < 

In the third phase we use the following result of Brace and Daykin [BD71j . 

Theorem 7 (Brace and Daykin, 1971). Let t > 1 be an integer, Y be a set of size s and X' 
he a subset of 2^ of density exceeding {t + 2)/2*~^^. If[jX' = Y, then there are t elements 
of X' whose union is also X. 

Proof of Theorem 0 In the hrst phase of the selection of the separating subfamily of X we 
decrease the maximal size of a part from n to at most 1/a. In case n < 1/a one can simply 
skip this phase. 

We mimic the proof of Theorem [2l We assume without loss of generality, that n = 
maxj |Xj| and set r = flogn]. We let X = be the r-dimensional vector space over the 
two element held with the usual inner product and choose / : X —)■ X that is injective 
on each X,. We set W = {O^ | x G X}, where consists of the elements a of X with 
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/(a) not orthogonal to x. We regard 2^ as a group with the symmetric difference operation 
(denoted by +) and note that W is a subgroup. As the density of is a we can choose 
C ^ T such that the density of within the coset (7 + W is at least a, that is the set 
S = {x ^ V : C + Ox ^ -A} satisfies 151 > a2^. 

With a < 1/2 the set S does not necessarily generate the whole of V. But we can still 
choose a basis i? C 5 for the subspace of V generated by S and it is easy to see that the set 
= {C \ Ox |xG5}U{C}of size |i?| + 1 separates each W into parts of size at most 
2 ^-l'S| < 1/a. Thus, we have decreased the part size to at most 1/a for a loss of less than 2. 
We have \J^i\ < logn + 2. 

In the second phase we do similarly as in the first phase but we do not have a linear 
structure on V any more. Let m be the maximal size of a part in the current partition of 
We take V to be an m element set and select a function f : X ^ V that is injective in 
every part. We set W = {f~^{H) \ H <ZV}. As before, W is a subgroup of 2^ (considered 
with the symmetric difference). We select C E X such that the density of X in the coset 
C + W is at least a, that is, the set S = {H \ C + G X} satisfies |5| > a2™'. 

In case S contains a set H that cuts V well, we choose one such set and include the sets C 
and C + in our separating family. This makes all the parts in the current partition 

be at most 3m/4. 

By Lemma[T]the selection of the set H E S cutting V well is possible as long as a > 
that is m > 101og(l/a). In the second phase we repeat the above procedure till all parts 
of the current partition is of size at most 101 og(l/a) and call fhe set of separating sets 
collected. As the maximal size of a part was at most 1 /a in the beginning of phase two, and 
this maximal size decreases by a constant factor in each round when we add two sets to X 2 
we have |J^ 2 | = 0 (log(l/a)). 

In the third phase we use a different strategy. Let {X' | z G /} be the set of parts in the 
current partition of X. We call a part X[ good if there is a set A G X cutting Xi well and 
we call XI bad if it has at least 2 elements, but it is not good. Let G = {z G / | X' is good}. 
For i E G we select a set Bi C 2^* of size \Bi\ = 2 Ail-i such that all sets in Bi cuts X' well 
and we have a set A E X with A n X' G Bi. This is possible as by Lemma [T] at least half 
the subsets of X' cut X' well. 

We define the function / : 2^ —>■ 2^^ by setting /(A) = {z G G \ A fl X■ G Bi]. Let 
X' = f{X). Clearly, / takes all values an equal number of times, thus the density of X' 
in 2^ is at least the density a of X in 2^. Note that for all z G G we have A E X with 
z G /(A), thus we have IJX' = G. We choose t = 0(log(l/a)) such that (t + 2)/2*+^ < a 
and apply Theorem [7] to find t sets X,... E X' with IjLi ~ Ai E X 

with /(Aj) = Jj and include these t sets in our partitioning family. Note that if x G X is 
contained in a good part X' of size m, then the size of the part containing x after considering 
these t new separating sets is at most 3m/4. 

We repeat the above procedure for [log(10 log(l/a))/log(4/3)] = 0(loglog(l/a)) times 
and obtain X 3 as the union of all the elements of X we selected. We clearly have IX 3 I = 
0 (log(l/a)loglog(l/a)). 

We call X E X fully separated if it forms a singleton part in the current partition after 
the third phase. Clearly, if x is not fully separated, it must have been in a bad part at 
some time. Let us choose the earliest bad part containing x and consider all the distinct 
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sets Yi,... ,Yj obtained this way from not fnlly separated elements. Clearly, these sets are 
pairwise disjoint. Let Y = By Lemma [H the ratio of snbsets of X not cntting Yi 

well is at most A random snbset A Y X intersects the sets Yi independently, thus 

the probability that it cuts none of the Yi well is at most By the 

definition of bad parts, no set A G cuts any of the sets Yi well, so we have a < 2“I^IAo_ 
Therefore, all non-singleton parts in the current partition after the third phase is contained 
in the set Y of size at most 101og(l/a). 

Finally in the last phase we select any set in X that refines our current partition and 
repeat this process until no further refinement is possible. Clearly, the set selected in this 
phase satishes \Xi\ < |F| = 0(log(l/Q!)). 

The family XiU X 2 A X^^ separates any pair of elements in the same part Xi that is 
separated by X and the size of this set is logn -|- 0(log(l/a) loglog(l/a)). This finishes the 
proof of Theorem |3l □ 


3. Proof of Theorem 0] 

For 0 < £ < 1, we call a constraint e-good if at least e\X\ members of X satisfy it, and 
e-bad otherwise. 

The proof of the hrst statement of Theorem 0] consists of two steps. First, a standard 
application of the probabilistic method shows that for any e > 0, all £-good constraints can 
be satisfied by (logA^)/e randomly chosen members of X. Second, we show that if e is set 
sufficiently small as a function of m and a but independent of |X| and N, then the number 
of e-bad constraints is bounded by another value Z depending on m and a, and independent 
of |X| and N. Since all constraints are satisfiable, it means that adding Z (well chosen) 
members of X to the (log A^)/e random members satisfying the e-good constraints we obtain 
a collection satisfying all the N constraints. 

To prove the first step, let C be a e-good constraint, and choose (logA^)/c members of X 
randomly, uniformly. 

log ^ f 

P(C' is not satisfied by any of the chosen sets) < (1 — c) = < —. 

Thus, with non-zero probability, all the at most N e-good constraints are satisfied by the 
randomly chosen members of X. 

For the second step let a be a large enough number depending on a and m to be set shortly. 
Assume that the supports of a2"* distinct e-bad constraints form a “sunflower”, that is any 
two of them intersect in the same set C. Let b = \C\ and note that 0 < 6 < m — 1. Clearly, 
we can hnd a of these constraints, say (Id, Wi) for 1 < i < a that contain the elements of 
C “on the same side”, that is C is the support of a constraint (Vb, hFo) with Vq Y Vi and 
Wo C IFj for i = 1, ..., a. Now consider a uniform random subset A C X. This set satisfies 
the constraint (Vb, hFo) with probability 2“^. If it satisfies (Vb,lLd), then the conditional 
probability that it also satisfies (Id, Wi) is 2^“™ for any i > 1. From the sunflower property 
we see that assuming A satisfies (Vq, Wq) the events that A satisfy (Id, Wi) are mutually 
independent, so the overall probability P that A satisfies at least one of them is exactly 

F = 2-^(1-(1-2^-"*)“). 
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On the other hand, A has the chance at least a to be in and assuming it is in it has 
a chance of less than e to satisfy any of those constraints. Therefore we have 

P < (1 — a) + aae. 

Let us select £ > 0 small enough and a large enough (depending on m and a) such that 

2-\l - (1 - 2'’-™)“) > (1 - a) + aae 

holds for any 0 < 6 < m — 1. This is possible as increasing a the left hand side approaches 
2 -fe > 2^“™ and for hxed a, and £ approaching 0, the right hand side approaches 1—a < 2^“™. 

With this choice of e and a the imminent contradiction in the last three displayed equations 
shows that the supports of no 02™ £-bad constraints form a sunflower. By the sunflower 
lemma of Erdos and Rado |ER60] we hnd that the total number of e-bad constraints is at 
most Z = m!2”^(a2™)™'. (The extra 2”* factor is coming from the possibility that many 
e-bad constraints may have the same support.) This bound is independent of N and |X| as 
claimed and hnishes the proof of the hrst statement of Theorem 01 

For the second part, for any m and N we construct a base set X, a family P of density 
strictly above 1 — 2^“"* and X one sided constraints {Vi, 0) of size m, all satished by exactly 
one member of the family P and such that different constraints are satished by different 
members of P. Thus, all constraints are satished by P but no subset of cardinality less than 
N satishes them all. 

For this we set |X| = X -|- m — 1 and identify a subset X C X of size |X| = m — 1. Let P 
consist of all the subsets of X not containing Y plus all the m element subsets. We select 
the constraints {Vi, 0) with all possible m-subsets Vi oi X containing Y. 

4. Proof of Theorem [5] 

Assume without loss of generality that U is the set of positive integers. Since the VC- 
dimension of P is d, for any {d + l)-element subset A = {oi, 02 ,..., 0 ^+ 1 } C U with oi < 
02 < ... < ttd+i, there is a set T{A) C {1, 2 ,..., d-|- 1} such that no F E P satishes 
F r\ A = {ai\i G T{A)}. If there is more than one such set, we hx T{A) arbitrarily, and we 
call it the type of A. 

By Ramsey’s Theorem, there is a set X = {yi,y 2 , ■ ■ ■ 2/n} C U with yi < ... < yn, such 
that all {d + l)-element subsets of X have the same type T. 

Let us call the index 1 < i < d regular if T separates i from i + 1 and singular otherwise. 
We let k stand for the number of singular indices. 

Consider any F E P. We claim that there is a set F* of at most k elements of X and a 
partition of X into at most d — k + 1 intervals such that the symmetric diherence F + F* 
does not separate any two elements in the same interval. To see this consider the greedy 
process looking for indices A < 12 <■■■ < id+i such that for the set FI = {yi ^,..., yi^_^_^} we 
have H n F = {y^. | j G T}. As such a set could not have type T, we cannot hnd all these 
indices, nevertheless, some of the indices can be found by a greedy process (for example, if 
1 G T, then we start with R = min{j \ yj E F}). We can satisfy the claim by making F* 
consist of the elements yi^, where j is singular and starting a new interval in the partition 
of X at every element yi^ with j regular. 




SEPARATION WITH RESTRICTED FAMILIES OF SETS 


9 


Now assume the family {Fi,..., F^} F F separates X. Let F* be the corresponding sets 
of size at most k and consider the set Id = X \ IJ^;^ F*. We have |ld| >n — mk determining 
at least n — mk — 1 neighboring pairs of elements. By the above claim each set F* separates 
at most d — k oi these neighboring elements, thus we must have n — mk — 1 < m{d — k), 
that is, m > {n — l)/k. This completes the proof of Theorem [5l 

5. Containment-separation 

In this section we prove Theorem O 

We start with the proof of ([I]). Let H = {xi,..., be an arbitrary n element subset of 
the real line and assume xi < ... < x„. Note that the family {[xi, Xj] | 1 < i < n} U{[xi, Xn] \ 
1 < i < n} of 2n — 4 convex sets containment-separates all the pairs in H. This proves 
clin) < 2n — 4. On the other hand realize that no convex set containment-separates more 
than one of the 2n — 4 pairs ({xi, x*}, {xi, Xj+i}) for 1 < i < n and ({xj, x„}, {xj_i, x„}) for 
1 < i < n. This proves cl{n) > 2n — 4. 

We will use the following monotonicity property: 

4{n)<ct-\n), 

for all choices oi k < d and n. To see this consider a set X of n points in general position in 
d-space and hnd a projection tt to (d — l)-space such that vr(X) is again n points in general 
position. A generic projection tt satishes this. Now hnd 4~^{n) convex sets to containment 
separate all the fc-sets of vr(X) and consider the inverse images of these sets for the projection 
TT. Clearly, these sets are convex and they containment-separate the fc-subsets of X. 

A similar monotonicity also holds in A: if n > fc -|- 1: 

4{n) < 4+iin)- 

This is because any collection of sets containment-separating the {k -|- l)-subsets of an n-set 
H also containment-separate the fc-subsets. Indeed, if A and B are fc-subsets, then any set 
containment separating A U {x} from B U {x} also containment-separates A from B. This 
trick works if we can choose x E H outside AUB. In case AUB = H we pick x E A\B and 
y E B\A and use that any set containment-separating A\J{y} from FU{x} also containment 
separates A from B. This latter trick fails if |AnF| = k — 1, but then |AUF| = fc-|-l<n. 

Next we prove ([2]). The upper bound follows from ([1]) via the monotonicity mentioned 
above. 

For the lower bound in ([2]) let p = (1,1) and consider the set X consisting of p and n — 1 
points on the unit circle around the origin, all in the hrst quadrant. This set is in general 
position. Let us denote the points of X on the circle Xi,... ,Xn-i in order of increasing x- 
coordinate. The property of the arrangement we use is that the the convex hull of {x*, Xj,p} 
contains all Xk with i < k < j. This implies that any convex set can containment-separate 
at most two of the pairs of two element sets ({xj,p}, {xi+i,p}) for 1 < i < n — 1 and the 
stated lower bound on c^n) follows. 

Next we turn to the upper bound in (jTj). By the monotonicity mentioned above it is 
enough for us to prove Let us consider a set X of n points in general 

position in the {k — l)-dimensional space. Each {k — l)-subset of X determines a hyperplane. 
Consider all the closed half-spaces bounded by one of these hyperplanes. This is a collection 
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of convex sets and it containment-separates all fc-subsets of X. Indeed, if A and B are 

distinct fc-subsets of X, then either the convex hull of A does not contain B, or vice versa. 
In the former case a supporting half-space of the convex hull of A containment-separates the 
sets. 

We turn to the proof of ([3]). The upper bound is a special case of the upper bound in (|1]). 
For the lower bound we give a construction. 

Let us assume n > 8. Let Xex be the vertex set of a regular k-gon around the origin for 
k = 2[n/4j. We call the opposite pairs of points in Xex a diameter. Let us hnd a point x 
distinct from the origin but so close to it that it is contained in the interior of the convex 
hull of any two diameters. We further assume that Xex U {x} is in general position. Let 
I = n — k and Xjn = {xj | 1 < i < /}, where x* = |x. 

Consider the pairs of 3 element sets {{p, —p, Xj}, {p, —p, Xj+i}}, where p G Xex (so {p, —p} 
is a diagonal) and 1 < i < / — 1. No set containing no diagonal can containment-separate 
any of these pairs, and convex sets containing more than one diagonal contain all x* in their 
interior, so they do not separate these pairs either. A convex set S containing a single 
diagonal {p, —p} separates at most one of these pairs, since, if x, G S, then we also have 
Xj G S for all j < i. 

This shows that Xex U Xin is a good choice for a hard to separate set, but it is not in 
general position. Fortunately, the above arguments are robust against small perturbations. 
Let us obtain X as the union of Xgx and a set {x' | 1 < i < n} where x' is £-close to Xj but 
perturbed in such a way that X is in general position. 

It is easy to see that if £ > 0 is small enough, then each convex set can containment- 
separate at most one of the pairs ({p, —P, x'}, {p, —p, x'_,_]^}. To containment-separate all 
these pairs, one needs at least (Z — l)k/2 convex sets, proving the lower bound in ([3]). 

Finally, we prove the lower bound in (jl]). For this we need the following result. For any 
n > d > 2, Karolyi and Valtr |KV03] constructed a set of n points in the d-dimensional 
space which contains at most Cd log*^”^ n points in convex position, where q depends only 
on d. We call such a point set a Karolyi- Valtr construction. 

Let us assume that k is odd as the case of even k comes from the monotonicity of c^(n) in 
k. We need to construct a set X of n points in general position in d-space, whose fc-subsets 
is hard to containment-separate with convex sets. Let m = Take a set A of size ^ 

and a set A' of size m such that all the ^ + m points m. AU A' are at unit distance from 
the origin and the unit vectors corresponding to any d of them are linearly independent. 
Let Xex = —A U A, where —A = {—p | p G A}. We call the points p and —p in Xex an 
opposite pair. Let F be an m-flat passing through the origin and the points of A'. Consider 
a ball B centered at the origin that is so small that the convex hull of any d opposite pairs 
of Xex contains B in its interior. Let Xin be an m-dimensional Karolyi-Valtr construction 
of ^ points in F n B and assume (without loss of generality) that Xjn as an m-dimensional 
set is in general position. (Note, however, that neither Xgx nor X[^ is in general position in 
d-space.) 

We claim that if a convex set S contains opposite pairs, then the points of X^ on the 
boundary of S are in convex position. Indeed, otherwise there would be a set id of m -|- 1 
points in Xjn, all on the boundary of S and forming a simplex in F such that the simplex 
contains yet another point x G Xin on the boundary of S. The contradiction comes from the 
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fact that in this case the convex hull of the union of H and the opposite pairs in S has x in 
its interior. 

As a consequence, we see that if a convex set contains opposite pairs, then its boundary 
contains at most Cm log™”^ n points of Xi^- 

Set £ > 0 very small and let X be a point set consisting of a point e-close to each point 
in Xex and two distinct points e-close to each point in We choose the points in X to be 
in general position and let / : Xgx —)■ X, /i : Xjn —)■ X and /2 : Xjn —)■ X be the functions 
showing our choices. 

Consider a set H of opposite pairs, and a point x G Xjn and let P{H, x) be the pair 
{f{H)U{fi{x)}, f{H)\J{f2{x)}). We claim that if e is small enough, then any convex set can 
containment-separate at most Cmlog™”^!! pairs P{H,x) with a £x H. Indeed, to separate 
the pair P{H,x) the convex set must contain f{H) and has to have a boundary point on 
the (short) interval fi{x)f2{x). In the limit for £ ^ 0 we find a convex set containing H and 
having x on its boundary. As we saw above this is possible for at most log™'”^ n points 
X G Xjf,. 

If a convex set S contains d opposite pairs, then it contains Xjn in its interior. Therefore, 
for small enough e, a convex set containing f{H) for a collection H of d opposite pairs 
containment-separates no pair P{H,x). 

To containment-separate P{H,x) a (convex) set must contain f{H), so the above bounds 
mean that (again, for small enough e) no convex set containment-separates more than 
Cm(i-i) log"^~^ n of the pairs P{H,x). Comparing this with the total number of tIPi) 

of the pairs P{H,x) shows that we need many convex sets to containment-separate the k- 
subsets of X. This finishes the proof of the lower bound in (Ilj) and with that the proof of 
Theorem |6l 


6. A REMARK: INTERSECTION-SEPARATION 

In the geometric setting, we discussed containment-separation. We can extend the notion 
of separation of points to fc-tuples in another way as well. We say that a set F intersection- 
separates a pair of /c-element subsets of X, if F intersects one of them and is disjoint from 
the other. A family P of subsets of X intersection-separates the fc-element subsets of X if, 
for any pair of fc-element subsets of X, there is a member of P that intersection-separates 
that pair. And thus, we can dehne the intersection-separation numbers as 




max min 

AcR'^,|X:|=n 
X in general pos. 




an intersection-separator of 
fc-subsets of X 


where denotes the family of convex subsets of 

The number if.{n) is always defined and at most n — 1 as the singleton subsets intersection- 
separate, even if we omit one of them. 

The following monotonicity properties can be verihed exactly as for containment-separation. 


( 5 ) 


and it{n) < 


12 ZS. lAnGI, M. NASZODI, J. pace, G. TARDOS, and G. TOTH 


Proposition 1. For any d 

we have 


( 6 ) 

( 7 ) 


2,3,... there is a constant q > 0 such that for any n > k > 2 


n + 3 

6 

n 


< i\{n) <n — l 


Cd . < itin) <n-l 

log n 


We have already mentioned the upper bounds. The lower bound in ([7j) can be proved by 
replacing each point of a Karolyi-Valtr construction (see Section [5]) by a pair of twins (two 
very close points). We provide a construction to show the lower bound in ([6]). 

By dS]) it is sufficient to consider the case k = 2. Suppose without loss of generality that 
n = 3m. We give the points in polar coordinates {r,(j)). Let £ > 0 be very small. For 
0 < i < m — 1, let p, be the point (1 — e, 2i7i/m), qi the point (1, 2m jm + £^), and r, the 
point (1, 2m/m — e^). Let X be the set of these n = 3m points. For any i, 0 < i < m — 1, 
consider the following pair of pairs: ({pi, {pi, ^*})- 

To hnish the proof, we claim that no convex set intersection-separates more than two 
of these m pairs of pairs. Indeed, suppose that a convex set K intersection-separates the 
pairs corresponding to the indices i, j and k. We may assume that the greatest angle of the 
triangle PiPjPk is at pj. Now, K contains one of g, and r*, one of qj and rj, and one of 
and Tfc. It is easy to see that, if £ is small enough, K contains pj, a contradiction. 
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